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atttaggtga 


cactatagaa 


ctcacctatc 


tccccaacac 


ctaataacat 


tcaatcactc 


60 


tttccactaa 


ccacctatct 


acatcaccaa 


gatatcacta 


gttctcgaga 


tgagcggctt 


120 


ccgcaagatg 


gccttcccca 


gcggcaaggt 


cgagggctgc 


atggtgcagg 


tcacctgcgg 


180 


caccactacc 


ctgaacggcc 


tgtggctgga 


tgacaccgtc 


tactgccccc 


gccacgtgat 


240 


ctgcaccgcc 


gaggacatgc 


tgaaccccaa 


ctacgaggac 


ctgctcatcc 


gcaagagcaa 


300 


ccactccttc 


ctggtgcagg 


ccggcaacgt 


ccagctgcgc 


gtgatcggcc 


acagcatgca 


360 


gaactgcctg 


ctccgcctga 


aggtggacac 


cagcaacccc 


aagaccccca 


agtacaagtt 


420 


cgtgcgcatc 


cagcccggcc 


agaccttcag 


cgtgctggcc 


tgctacaacg 


gcagccccag 


480 


cggcgtgtac 


cagtgcgcca 


tgcgccccaa 


ccacaccatc 


aagggcagct 


tcctgaacgg 


540 


gagctgcggc 


agcgtgggct 


tcaacatcga 


ctacgactgc 


gtaagcttct 


gctacatgca 


600 


ccacatggag 


ctgcccaccg 


gcgtgcacgc 


cggcaccgac 


ctggagggca 


agttctacgg 


660 


ccccttcgtg 


gaccgccaga 


ccgcccaggc 


cgccggcacc 


gacaccacca 


tcaccctgaa 


720 


cgtgctggcc 


tggctgtacg 


ccgccgtgat 


caacggcgac 


cgctggttcc 


tgaaccgctt 


780 


caccactacc 


ctgaacgact 


tcaacctggt 


ggccatgaag 


tacaactacg 


agcccctgac 


840 


ccaggaccac 


gtggacatcc 


tgggccccct 


gagcgcccag 


accggcatcg 


ccgtcctgga 


900 


catgtgcgcc 


gccctgaagg 


agctgctcca 


gaacggcatg 


aacggccgca 


ccatcctggg 


960 


cagcaccatc 


ctggaggacg 


agttcacccc 


cttcgacgtc 


gtgcgccagt 


gcagcggcgt 


1020 


gaccttccag 


taaggatcca 


tatatagggc 


ccgggttata 
i 


attacctcag 


gtcgacgtcc 


1080 



catggttttg 


tatagaattt 


acggctagcg 


ccggatgcga 


cgccggtcgc 


gtcttatccg 


1140 


gccttcctat 


atcaggctgt 


gtttaagacg 


ccgccgcttc 


gcccaaatcc 


ttatgccggt 


1200 


tcgacggctg 


gacaaaatac 


tgtttatctt 


cccagcgcag 


gcaggttaat 


gtaccacccc 


1260 


agcagcagcc 


ggtatccagc 


gcgtatatac 


cttccggcgt 


acctttgccc 


tccagcgatg 


1320 


cccagtgacc 


aaaggcgatg 


ctgtattctt 


cagcgacagg 


gccaggaatc 


gcaaaccacg 


1380 


gtttcagtgg 


ggcaggggcc 


tcttccggcg 


attcttacta 


gctagtatgc 


ataggtgctg 


1440 


aaatataaag 


tttgtgtttc 


taaaacacac 


gtggtacgta 


cgataacgta 


cagtgttttt 


1500 


ccctccactt 


aaatcgaagg 


gtagtgtctt 


ggagcgcgcg 


gagtaaacat 


atatggttca 


1560 


tatatgtccg 


taggcacgta 


aaaaaagcga 


gggattcgaa 


ttcccccgga 


acccccggtt 


1620 


ggggcccacg 


cctcgatcga 


gcaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaagctt 


1680 


ggcgtaatca 


tggtcatagc 


tgtttcctgt 


gtgaaattgt 


tatccgctca 


caattccaca 


1740 


caacatacga 


gccggaagca 


taaagtgtaa 


agcctggggt 


gcctaatgag 


tgagctaact 


1800 


cacattaatt 


gcgttgcgct 


cactgcccgc 


tttccagtcg 


ggaaacctgt 


cgtgccagct 


1860 


gcattaatga 


atcggccaac 


gcgcggggag 


aggcggtttg 


cgtattgggc 


gctcttccgc 


1920 


ttcctcgctc 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


1980 


ctcaaaggcg 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


2040 


agcaaaaggc 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


2100 


taggctccgc 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


2160 


cccgacagga 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


2220 


tgttccgacc 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


2280 


gctttctcat 


agctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


2340 


gggctgtgtg 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


2400 


tcttgagtcc 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


2460 


gattagcaga 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


2520 


cggctacact 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


2580 


aaaaagagtt 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


2640 


tgtttgcaag 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


2700 


ttctacgggg 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


2760 


attatcaaaa 


aggatcttca 


cctagatcct 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


2820 


ctaaagtata 


tatgagtaaa 


cttggtctga 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


2880 


tatctcagcg 


atctgtctat 


ttcgttcatc 


catagttgcc 


tgactccccg 


tcgtgtagat 


2940 


aactacgata 


cgggagggct 


taccatctgg 


ccccagtgct 
2 


gcaatgatac 


cgcgagaccc 


3000 



acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 3060 

aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 3120 

agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt 3180 

ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg 3240 

agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt 3300 

tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc 3360 

tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 3420 

attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa 3480 

taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg 3540 

aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc 3600 

caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag 3660 

gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 3720 

cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 3780 

tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc 3840 

acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac 3900 

gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 3960 

cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 4020 

cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat 4080 

tgtactgaga gtgcaccata tcgacgctct cccttatgcg actcctgcat taggaagcag 4140 

cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc atgcaaggag 4200 

atggcgccca acagtccccc ggccacgggg cctgccacca tacccacgcc gaaacaagcg 4260 

ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg 4320 

ccagcaaccg cacctgtggc gccggtgatg ccggccacga tgcgtccggc gtagaggatc 4380 

tggctagcga tgaccctgct gattggttcg ctgaccattt ccggggtgcg gaacggcgtt 4440 

accagaaact cagaaggttc gtccaaccaa accgactctg acggcagttt acgagagaga 4500 

tgatagggtc tgcttcagta agccagatgc tacacaatta ggcttgtaca tattgtcgtt 4560 

agaacgcggc tacaattaat acataacctt atgtatcata cacatacg 4608 

<210> 2 
<211> 6389 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid 
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<400> 2 
atttaggtga 


cactatagaa 


ctcacctatc 


tccccaacac 


ctaataacat 


tcaatcactc 


60 


tttccactaa 


ccacctatct 


acatcaccaa 


gatatcactc 


gagaatggtg 


agcaagggcg 


120 


aggagctgtt 


caccggggtg 


gtgcccatcc 


tggtcgagct 


ggacggcgac 


gtgaacggcc 


180 


acaagttcag 


cgtgtccggc 


gagggcgagg 


gcgatgccac 


ctacggcaag 


ctgaccctga 


240 


agttcatctg 


caccaccggc 


aagctgcccg 


tgccctggcc 


caccctcgtg 


accaccttca 


300 


cctacggcgt 


gcagtgcttc 


agccgctacc 


ccgaccacat 


gaagcagcac 


gacttcttca 


360 


agtccgccat 


gcccgaaggc 


tacgtccagg 


agcgcaccat 


cttcttcaag 


gacgacggca 


420 


actacaagac 


ccgcgccgag 


gtgaagttcg 


agggcgacac 


cctggtgaac 


cgcatcgagc 


480 


tgaagggcat 


cgacttcaag 


gaggacggca 


acatcctggg 


gcacaagctg 


gagtacaact 


540 


acaacagcca 


caacgtctat 


atcatggccg 


acaagcagaa 


gaacggcatc 


aaggtgaact 


600 


tcaagatccg 


ccacaacatc 


gaggacggca 


gcgtgcagct 


cgccgaccac 


taccagcaga 


660 


acacccccat 


cggcgacggc 


cccgtgctgc 


tgcccgacaa 


ccactacctg 


agcacccagt 


720 


ccgccctgag 


caaagacccc 


aacgagaagc 


gcgatcacat 


ggtcctgctg 


gagttcgtga 


780 


ccgccgccgg 


gatcactcac 


ggcatggacg 


agctgtacaa 


gcccccccag 


accagcatca 


840 


cctctgccgt 


gctgcagagc 


ggcttccgca 


agatggcctt 


ccccagcggc 


aaggtgatgt 


900 


tacgtcctgt 


agaaacccca 


acccgtgaaa 


tcaaaaaact 


cgacggcctg 


tgggcattca 


960 


gtctggatcg 


cgaaaactgt 


ggaattgatc 


agcgttggtg 


ggaaagcgcg 


ttacaagaaa 


1020 


gccgggcaat 


tgctgtgcca 


ggcagtttta 


acgatcagtt 


cgccgatgca 


gatattcgta 


1080 


attatgcggg 


caacgtctgg 


tatcagcgcg 


aagtctttat 


accgaaaggt 


tgggcaggcc 


1140 


agcgtatcgt 


gctgcgtttc 


gatgcggtca 


ctcattacgg 


caaagtgtgg 


gtcaataatc 


1200 


aggaagtgat 


ggagcatcag 


ggcggctata 


cgccatttga 


agccgatgtc 


acgccgtatg 


1260 


ttattgccgg 


gaaaagtgta 


cgtatcaccg 


tttgtgtgaa 


caacgaactg 


aactggcaga 


1320 


ctatcccgcc 


gggaatggtg 


attaccgacg 


aaaacggcaa 


gaaaaagcag 


tcttacttcc 


1380 


atgatttctt 


taactatgcc 


ggaatccatc 


gcagcgtaat 


gctctacacc 


acgccgaaca 


1440 


cctgggtgga 


cgatatcacc 


gtggtgacgc 


atgtcgcgca 


agactgtaac 


cacgcgtctg 


1500 


ttgactggca 


ggtggtggcc 


aatggtgatg 


tcagcgttga 


actgcgtgat 


gcggatcaac 


1560 


aggtggttgc 


aactggacaa 


ggcactagcg 


ggactttgca 


agtggtgaat 


ccgcacctct 


1620 


ggcaaccggg 


tgaaggttat 


ctctatgaac 


tgtgcgtcac 


agccaaaagc 


cagacagagt 


1680 


gtgatatcta 


cccgcttcgc 


gtcggcatcc 


ggtcagtggc 


agtgaagggc 


gaacagttcc 


1740 


tgattaacca 


caaaccgttc 


tactttactg 


gctttggtcg 


tcatgaagat 


gcggacttgc 


1800 


gtggcaaagg 


attcgataac 


gtgctgatgg 


tgcacgacca 


cgcattaatg 


gactggattg 


1860 


gggccaactc 


ctaccgtacc 


tcgcattacc 


cttacgctga 
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agagatgctc 


gactgggcag 


1920 



atgaacatgg 


catcgtggtg 


attgatgaaa 


ctgctgctgt 


cggctttaac 


ctctctttag 


1980 


gcattggttt 


cgaagcgggc 


aacaagccga 


aagaactgta 


cagcgaagag 


gcagtcaacg 


2040 


gggaaactca 


gcaagcgcac 


ttacaggcga 


ttaaagagct 


gatagcgcgt 


gacaaaaacc 


2100 


acccaagcgt 


aataatQtqci 


agtattgcca 


acgaaccgga 


tacccgtccg 


caaggtgcac 


2160 


gggaatattt 


cgcgccactg 


gcggaagcaa 


cgcgtaaact 


cgacccgacg 


cgtccgatca 


2220 


cctgcgtcaa 


tgtaatgttc 


tgcgacgctc 


acaccgatac 


catcagcgat 


ctctttgatg 


2280 


tgctgtgcct 


gaaccgttat 


tacggatggt 


atgtccaaag 


cggcgatttg 


gaaacggcag 


2340 


agaaggtact 


ggaaaaagaa 


cttctggcct 


ggcaggagaa 


actgcatcag 


ccgattatca 


2400 


tcaccgaa'ta 




acgttagccg 


ggctgcactc 


aatgtacacc 


gacatgtgga 


2460 


gtgaagagtia 


"tcagtgiigca 


"tggctggata 


tgtatcaccg 


cgtctttgat 


cacatcaaca 


2520 


ccgtcgtxgg 


"tgaacaggta 


"tggaa'tttcg 


ccgattttgc 


gacctcgcaa 


ggcatattgc 


2580 


gcgttggcgg 


"Caacaagaaa 


yyy<*<-*- l h-o 


cticgcgaccg 


caaaccgaag 


tcggcggctt 


2640 


"ttctgctgca 


aaaacgctigg 


actggcatga 


acttcggtga 


aaaaccgcag 


caaaaaaaca 
^- a yyy a yy^ a 


2700 


aacaatgaat 


caacaactct 


cctggcgcac 


catcgtcggc 


tacagcctcg 


ggaattgcta 


2760 


ccgagctcgg 


tacctgtccg 


cggtcgcgac 


gtacgcgggc 


ggccgccata 


aattggatcc 


2820 


atatataggg 


cccgggttat 


aattacctca 


ggtcgacgtc 


ccatggtttt 


gtatagaatt 


2880 


tacggctagc 


gccggatgcg 


- - - y y u y 


cgtcttatcc 


ggccttccta 


tatcaggctg 


2940 


tgtttiaagac 


gccgccgctt 


cgcccaaatc 


cttatgccgg 


ttcgacggct 


ggacaaaata 


3000 


ctgtttatct 


tcccagcgca 


aacaaattaa 


tgtaccaccc 


cagcagcagc 


cggtatccag 


3060 


cgcgtatata 


ccttccggcg 


tacctttgcc 


ctccagcgat 


gcccagtgac 


caaaggcgat 


3120 


gctgtattct 


"tcagcgacag 


ggccaggaat 


cgcaaaccac 


ggtttcagtg 


gggcaggggc 


3180 


ctcttccggc 


gat x.cx. "tact 


agctagtatg 


cataggtgct 


gaaatataaa 


gtttgtgttt 


3240 


ctaaaacaca 




acga~taacg*C 


acagtgfttt: 


tccctccact 


taaatcgaag 


3300 


g g "t a g t g t c x. 


Lyyayi~yt_y*_ 


ggag~taaaca 


tataiggtitc 


atatatgtcc 


gtaggcacgt 


3360 


aaaaaaagcg 


agggattcga 


attcccccgg 


aacccccggt 


taaaacccac 


gcctcgatcg 


3420 


agcaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaagct 


tggcgtaatx 


atggtcatag 


3480 


ctgtttcctg 


tgtgaaattg 


ttatccgctc 


acaattccac 


acaacatacg 


agccggaagc 


3540 


ataaagtgta 




tgcctaatga 


gtgagctaac 


tcacattaat 


tgcgttgcgc 


3600 


tcactgcccg 


ctttccagtc 


gggaaacctg 


tcgtgccagc 


tgcattaatg 


aatcggccaa 


3660 


cgcgcgggga 


gaggcggttt 


gcgtattggg 


cgctcttccg 


cttcctcgct 


cactgactcg 


3720 


ctgcgctcgg 


tcgttcggct 


gcggcgagcg 


gtatcagctc 


actcaaaggc 


ggtaatacgg 


3780 


ttatccacag 


aatcagggga 


taacgcagga 


aagaacatgt 
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gagcaaaagg 


ccagcaaaag 


3840 



gCCa.gQ3.3CC 


gtaaaaaggc 


cgcgttgctg 


gcgtttttcc 


ataggctccg 


cccccctgac 


3900 


gagcatcaca 


aaaatcgacg 


ctcaagtcag 


aggtggcgaa 


acccgacagg 


actataaaga 


3960 


taccaggcgt 


ttccccctgg 


aagctccctc 


gtgcgctctc 


ctgttccgac 


cctgccgctt 


4020 


ac egg at ace 


tgtccgcctt 


tctcccttcg 


ggaagcgtgg 


cgctttctca 


tagctcacgc 


4080 


tgtaggtatc 


tcagttcggt 


gtaggtcgtt 


cgctccaagc 


tgggctgtgt 


gcacgaaccc 


4140 


cccg"t"tcagc 


ccgaccgctg 


cgccttatcc 


ggtaactatc 


gtcttgagtc 


caacccggta 


4200 


agacacgact 


"tatcgccact 


ggcagcagcc 


actggtaaca 


ggattagcag 


aacaaaatat 


4260 


gtaggcggtg 


ctacagag"t~t 


cttgaagtgg 


tggectaact 


acggctacac 


tagaaggaca 


4320 


gtatt'Cgg'ta 


trtnrnrtrt 


getgaageca 


g"t"Cacct"tcg 


gaaaaagagt 


tggtagctct 


4380 


"Cga"t ceggea 


aacaaaccac 


cgctggtagc 


ggtggttttt 


ttgtttgcaa 


gcagcagatt 


4440 


aegegcagaa 


aaaaaggatc 


tcaagaagat 


cctttgatct 


tttctacggg 


gtctgacget 


4500 


cagtggaacg 


aaaactcacg 


ttaagggatt 


ttggtcatga 


gattatcaaa 


aaggatcttc 


4560 


acctagatcc 


ttttaaatta 


aaaatgaagt 


tttaaatcaa 


tctaaagtat 


atatgagtaa 


4620 


acttgg'tctg 


acagttacca 


atgettaate 


agtgaggcac 


ctatctcagc 


gatctgtcta 


4680 


tttcgttcat 


ccatagttgc 


ctgactcccc 


gtcgtgtaga 


taactacgat 


aegggaggge 


4740 


ttaccatctg 


gccccagtgc 


tgcaatgata 


ccgcgagacc 


cacgctcacc 


ggctccagat 


4800 




taaaccagcc 


ageeggaagg 


gccgagcgca 


gaagtggtcc 


tgcaacttta 


4860 


Tccgcctcca 


tccagtctat 


"taattg'ttgc 


egggaagcta 


gagtaagtag 


"Ctcgccagtt 


4920 


aatagt'ttgc 


gcaacg*t"tg~£ 


tgccat'tgc't 


acaggca*tcg 


tggtgtcacg 


ctcgtcgttt 


4980 


ggtatggc't't 


cat *t cage tc 


cggttcccaa 


cgatcaaggc 


gagttacatg 


atcccccatg 


5040 


ttgtgcaaaa 


aagcggttag 


ctccttcggt 


cccccgatcg 


ttgtcagaag 


taagttggcc 


5100 


gcagtgttat 


cac~tca~tgg"C 


tatggcagca 


ctgeataatt 


ctcttactgt 


catgccatcc 


5160 


gtaagatget 


tttctgtgac 


tggtgagtac 


tcaaccaagt 


cattctgaga 


atagtgtatg 


5220 


cggcgaccga 


gttgctcttg 


cccggcgtca 


ataegggata 


ataccgcgcc 


acatagcaga 


5280 


actttaaaag 


tgctcatcat 


tggaaaacgt 


"tcttcaaaac 


gaaaactctc 


aaggatctta 


5340 


ccgctgttga 


gatccagttc 


gatgtaaccc 


actcgtgcac 


ccaactgatc 


ttcagcatct 


5400 


tttactttca 


ccagcgtftc 


tgggtgagca 


aaaacaggaa 


ggcaaaatgc 


cgcaaaaaag 


5460 


yyad Ldctyy y 


r c a r nna a 

LiJaLaLyyacl 


a"Cg"ttgaa"ta 


ctcatactct 


tcctttttca 


atattattga 


5520 


agcatttatc 


agggttattg 


tctcatgagc 


ggatacatat 


ttgaatgtat 


ttagaaaaat 


5580 


aaacaaatag 


gggttccgcg 


cacatttccc 


cgaaaagtgc 


cacctgacgt 


ctaagaaacc 


5640 


attattatca 


tgacattaac 


ctataaaaat 


aggegtatea 


cgaggccctt 


tcgtctcgcg 


5700 


cgtttcggtg 


atgacggtga 


aaacctctga 


cacatgcagc 
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tcccggagac 


ggtcacagct 


5760 



tgtctgtaag 


cggatgccgg 


gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc 


5820 


gggtgtcggg 


gctggcttaa 


ctatgcggca tcagagcaga ttgtactgag agtgcaccat 


5880 


atcgacgctc 


tcccttatgc 


gactcctgca ttaggaagca gcccagtagt aggttgaggc 


5940 


cgttgagcac 


cgccgccgca 


aggaatggtg catgcaagga gatggcgccc aacagtcccc 


6000 


rnnrrarnnn 

i -yy | - , - a, -yyy 


gcctgccacc 


atacccacgc cgaaacaagc gctcatgagc ccgaagtggc 


6060 


gagcccgatc 


ttccccatcg 


gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg 


6120 


caccaataat 


accaaccaco. 


atgcgtccgg cgtagaggat ctggctagcg atgaccctgc 


6180 


tgattggttc 


gctgaccatt 


tccggggtgc ggaacggcgt taccagaaac tcagaaggtt 


6240 


cgtccaacca 


aaccgactct 


gacggcagtt tacgagagag atgatagggt ctgcttcagt 


6300 


aagccagatg 


ctacacaatt 


aggcttgtac atattgtcgt tagaacgcgg ctacaattaa 


6360 


tacataacct 


tatgtatcat 


acacatacg 


6389 



<210> 3 

<211> 4608 

<212> DNA 

<213> Artificial 

<220> 

<223> Designed dna(C145a) based on protease originated from human 
coronavi rus 



<400> 3 
atttaggtga 


cactatagaa 


ctcacctatc tccccaacac 


ctaataacat 


tcaatcactc 


60 


tttccactaa 


ccacctatct 


acatcaccaa gatatcacta 


gttctcgaga 


tgagcggctt 


120 


ccgcaagatg 


gccttcccca 


gcggcaaggt cgagggctgc 


atggtgcagg 


tcacctgcgg 


180 


caccactacc 


ctgaacggcc 


tgtggctgga tgacaccgtc 


tactgccccc 


gccacgtgat 


240 


ctgcaccgcc 


gaggacatgc 


tgaaccccaa ctacgaggac 


ctgctcatcc 


gcaagagcaa 


300 


ccactccttc 


ctggtgcagg 


ccggcaacgt ccagctgcgc 


gtgatcggcc 


acagcatgca 


360 


gaactgcctg 


ctccgcctga 


aggtggacac cagcaacccc 


aagaccccca 


agtacaagtt 


420 


cgtgcgcatc 


cagcccggcc 


agaccttcag cgtgctggcc 


tgctacaacg 


gcagccccag 


480 


cggcgtgtac 


cagtgcgcca 


tgcgccccaa ccacaccatc 


aagggcagct 


tcctgaacgg 


540 


gagcgccggc 


agcgtgggct 


tcaacatcga ttacgactgc 


gtaagcttct 


gctacatgca 


600 


ccacatggag 


ctgcccaccg 


gcgtgcacgc cggcaccgac 


ctggagggca 


agttctacgg 


660 


ccccttcgtg 


gaccgccaga 


ccgcccaggc cgccggcacc 


gacaccacta 


tcaccctgaa 


720 


cgtgctggcc 


tggctgtacg 


ccgccgtgat caacggcgac 


cgctggttcc 


tgaaccgctt 


780 


caccactacc 


ctgaacgact 


tcaacctggt ggccatgaag 


tacaactacg 


agcccctgac 


840 


ccaggaccac 


gtggacatcc 


tgggccccct gagcgcccag 


accggcatcg 


ccgtcctgga 


900 
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catgtgcgcc 


gccctgaagg 


agctgctcca 


gaacggcatg 


aacggccgca 


ccatcctggg 


960 


cagcaccatc 


ctggaggacg 


agttcacccc 


cttcgacgtc 


gtgcgccagt 


gcagcggcgt 


1020 


gaccttccag 


taaggatcca 


tatatagggc 


ccgggttata 


attacctcag 


gtcgacgtcc 


1080 


catggttttg 


tatagaattt 


acggctagcg 


ccggatgcga 


cgccggtcgc 


gtcttatccg 


1140 


gccttcctat 


atcaggctgt 


gtttaagacg 


ccgccgcttc 


gcccaaatcc 


ttatgccggt 


1200 


tcgacggctg 


gacaaaatac 


tgtttatctt 


cccagcgcag 


gcaggttaat 


gtaccacccc 


1260 


agcagcagcc 


ggtatccagc 


gcgtatatac 


cttccggcgt 


acctttgccc 


tccagcgatg 


1320 


cccagtgacc 


aaaggcgatg 


ctgtattctt 


cagcgacagg 


gccaggaatc 


gcaaaccacg 


1380 


gtttcagtgg 


ggcaggggcc 


tcttccggcg 


attcttacta 


gctagtatgc 


ataggtgctg 


1440 


aaatataaag 


tttgtgtttc 


taaaacacac 


gtggtacgta 


cgataacgta 


cagtgttttt 


1500 


ccctccactt 


aaatcgaagg 


gtagtgtctt 


ggagcgcgcg 


gagtaaacat 


atatggttca 


1560 


tatatgtccg 


taggcacgta 


aaaaaagcga 


gggattcgaa 


ttcccccgga 


acccccggtt 


1620 


ggggcccacg 


cctcgatcga 


gcaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaagctt 


1680 


ggcgtaatca 


tggtcatagc 


tgtttcctgt 


gtgaaattgt 


tatccgctca 


caattccaca 


1740 


caacatacga 


gccggaagca 


taaagtgtaa 


agcctggggt 


gcctaatgag 


tgagctaact 


1800 


cacattaatt 


gcgttgcgct 


cactgcccgc 


tttccagtcg 


ggaaacctgt 


cgtgccagct 


1860 


gcattaatga 


atcggccaac 


gcgcggggag 


aggcggtttg 


cgtattgggc 


gctcttccgc 


1920 


ttcctcgctc 


actgactcgc 


tgcgctcggt 


cgttcggctg 


cggcgagcgg 


tatcagctca 


1980 


ctcaaaggcg 


gtaatacggt 


tatccacaga 


atcaggggat 


aacgcaggaa 


agaacatgtg 


2040 


agcaaaaggc 


cagcaaaagg 


ccaggaaccg 


taaaaaggcc 


gcgttgctgg 


cgtttttcca 


2100 


taggctccgc 


ccccctgacg 


agcatcacaa 


aaatcgacgc 


tcaagtcaga 


ggtggcgaaa 


2160 


cccgacagga 


ctataaagat 


accaggcgtt 


tccccctgga 


agctccctcg 


tgcgctctcc 


2220 


tgttccgacc 


ctgccgctta 


ccggatacct 


gtccgccttt 


ctcccttcgg 


gaagcgtggc 


2280 


gctttctcat 


agctcacgct 


gtaggtatct 


cagttcggtg 


taggtcgttc 


gctccaagct 


2340 


gggctgtgtg 


cacgaacccc 


ccgttcagcc 


cgaccgctgc 


gccttatccg 


gtaactatcg 


2400 


tcttgagtcc 


aacccggtaa 


gacacgactt 


atcgccactg 


gcagcagcca 


ctggtaacag 


2460 


gattagcaga 


gcgaggtatg 


taggcggtgc 


tacagagttc 


ttgaagtggt 


ggcctaacta 


2520 


cggctacact 


agaaggacag 


tatttggtat 


ctgcgctctg 


ctgaagccag 


ttaccttcgg 


2580 


aaaaagagtt 


ggtagctctt 


gatccggcaa 


acaaaccacc 


gctggtagcg 


gtggtttttt 


2640 


tgtttgcaag 


cagcagatta 


cgcgcagaaa 


aaaaggatct 


caagaagatc 


ctttgatctt 


2700 


ttctacgggg 


tctgacgctc 


agtggaacga 


aaactcacgt 


taagggattt 


tggtcatgag 


2760 


attatcaaaa 


aggatcttca 


cctagatcct 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


2820 
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ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 2880 

tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat 2940 

aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc 3000 

acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 3060 

aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 3120 

agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta caggcatcgt 3180 

ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg 3240 

agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt 3300 

tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc 3360 

tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 3420 

attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa 3480 

taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg 3540 

aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc 3600 

caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag 3660 

gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 3720 

cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt 3780 

tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc 3840 

acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac 3900 

gaggcccttt cgtctcgcgc gtttcggtga tgacggtgaa aacctctgac acatgcagct 3960 

cccggagacg gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 4020 

cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat cagagcagat 4080 

tgtactgaga gtgcaccata tcgacgctct cccttatgcg actcctgcat taggaagcag 4140 

cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc atgcaaggag 4200 

atggcgccca acagtccccc ggccacgggg cctgccacca tacccacgcc gaaacaagcg 4260 

ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg 4320 

ccagcaaccg cacctgtggc gccggtgatg ccggccacga tgcgtccggc gtagaggatc 4380 

tggctagcga tgaccctgct gattggttcg ctgaccattt ccggggtgcg gaacggcgtt 4440 

accagaaact cagaaggttc gtccaaccaa accgactctg acggcagttt acgagagaga 4500 

tgatagggtc tgcttcagta agccagatgc tacacaatta ggcttgtaca tattgtcgtt 4560 

agaacgcggc tacaattaat acataacctt atgtatcata cacatacg 4608 

<210> 4 
<211> 5298 
<212> DNA 



<213> Artificial sequence 
<220> 

<223> plasmid 



<400> 4 
atttaggtga 


cactatagaa 


rtrarrtaf r 


t CCCCcL3.C3.C 


ctaataacat 


tcaatcactc 


60 


tttccactaa 


ccacctatct 


acat caccaa 


Q3."t3."tC3.C"tC 


gagatggaat 


cccctatact 


120 


aggttattgg 


aaaattaagg 


gccttgtgca 


3LCCC3.C"tCQ3. 


cttcttttgg 


aatat cttga 


180 


agaaaaatat 


gaagagcatt 


tgtatgagcg 


CQ3."tQ3.<3.CJCJ"t 


gataaatggc 


gaaacaaaaa 


240 


gtttgaattg 


ggtttggagt 


tt cccaatct 


trrttaf tat 
LL.L. L La L La L 


attgatggtg 


atgttaaatt 


300 


aacacagtct 


atggccatca 


tacgttatat 


agctgacaag 


cacaacatgt 


tyyy i_y y l uy 


360 


tccaaaagag 


cgtgcagaga 


tttcaatgct 


tgaaggagcg 


gttttggata 


ttagatacgg 


420 


tgtttcgaga 


attgcatata 


gtaaagactt 


tgaaactctc 


aaagttgatt 


ttcttagcaa 


480 


gctacctgaa 


atgctgaaaa 


tgttcgaaga 


tcgtttatgt 


cataaaacat 


atttaaatgg 


540 


tgatcatgta 


acccatcctg 


acttcatgtt 


gtatgacgct 


cttgatgttg 


ILILalaLdl. 


600 


ggacccaatg 


tgcctggatg 


cgttcccaaa 


attagtttgt 


tttaaaaaac 


gtattgaagc 


660 


tatcccacaa 


attgataagt 


acttgaaatc 


cagcaagtat 


atagcat ggc 


ctttgcaggg 


720 


ctggcaagcc 


acgtttggtg 


gtggcgacca 


tcctccaaaa 


tcggacccac 


cgcagaccag 


780 


catcacctct 


gccgtgctgc 


agagcggctt 


ccgcaagatg 


gcctt cccca 


gcggcaaggt 


840 


cgagggctgc 


atggtgcagg 


tcacctgcgg 


caccactacc 


ctgaacggcc 


tgtggctgga 


900 


tgacaccgtc 


tactgccccc 


gccacgtgat 


ctgcaccgcc 


gaggacatgc 


tgaaccccaa 


960 


ctacgaggac 


ctgctcatcc 


gcaagagcaa 


ccact ccttc 


ctggtgcagg 


ccggcaacgt 


1020 


ccagctgcgc 


gtgatcggcc 


acagcatgca 


gaactgcctg 


ctccgcctga 


aggtggacac 


1080 


cagcaacccc 


aagaccccca 


agtacaagtt 


cgtgcgcatc 


cagcccggcc 


agaccttcag 


1140 


cgtgctggcc 


tgctacaacg 


gcagccccag 


cggcgtgtac 


cagtgcgcca 


tgcgccccaa 




ccacaccatc 


aagggcagct 


tcctgaacgg 


gagctgcggc 


agcgtgggct 


tcaacatcga 


1260 


ctacgactgc 


gtaagcttct 


gctacatgca 


ccacaTggag 




gcgtgcacgc 


1320 


cggcaccgac 


ctggagggca 


agttctacgg 


ccccttcgtg 


gaccgccaga 


ccgcccaggc 


1380 


cgccggcacc 


gacaccacta 


tcaccctgaa 


cgtgctggcc 


tggctgtacg 


ccgccgtgat 




caacggcgac 


cgctggttcc 


tgaaccgctt 


caccactacc 


ctgaacgact 


tcaacctggt 


1500 


ggccatgaag 


tacaactacg 


agcccctgac 


ccaggaccac 


gtggacatcc 


tgggccccct 


1560 


gagcgcccag 


accggcatcg 


ccgtcctgga 


catgtgcgcc 


gccctgaagg 


agctgctcca 


1620 


gaacggcatg 


aacggccgca 


ccatcctggg 


cagcaccatc 


ctggaggacg 


agttcacccc 


1680 


cttcgacgtc 


gtgcgccagt 


gcagcggcgt 


gaccttccag 


taaggatcca 


tatatagggc 


1740 
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ccgggttata attacctcag gtcgacgtcc catggttttg tatagaattt acggctagcg 1800 

ccggatgcga cgccggtcgc gtcttatccg gccttcctat atcaggctgt gtttaagacg 1860 

ccgccgcttc gcccaaatcc ttatgccggt tcgacggctg gacaaaatac tgtttatctt 1920 

cccagcgcag gcaggttaat gtaccacccc agcagcagcc ggtatccagc gcgtatatac 1980 

cttccggcgt acctttgccc tccagcgatg cccagtgacc aaaggcgatg ctgtattctt 2040 

cagcgacagg gccaggaatc gcaaaccacg gtttcagtgg ggcaggggcc tcttccggcg 2100 

attcttacta gctagtatgc ataggtgctg aaatataaag tttgtgtttc taaaacacac 2160 

gtggtacgta cgataacgta cagtgttttt ccctccactt aaatcgaagg gtagtgtctt 2220 

ggagcgcgcg gagtaaacat atatggttca tatatgtccg taggcacgta aaaaaagcga 2280 

gggattcgaa ttcccccgga acccccggtt ggggcccacg cctcgatcga gcaaaaaaaa 2340 

aaaaaaaaaa aaaaaaaaaa aaaaaagctt ggcgtaatca tggtcatagc tgtttcctgt 2400 

gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 2460 

agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 2520 

tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag 2580 

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 2640 

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 2700 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 2760 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 2820 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 2880 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 2940 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 3000 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 3060 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 3120 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 3180 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 3240 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 3300 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 3360 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 3420 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 3480 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 3540 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 3600 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 3660 

11 



ccccagtgct 


gcaatgatac 


cgcgagaccc 


acgctcaccg 


gctccagatt 


tatcagcaat 


3720 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


aagtggtcct 


gcaactttat 


ccgcctccat 


3780 


ccagtctatt 


aattgttgcc 


gggaagctag 


agtaagtagt 


tcgccagtta 


atagtttgcg 


3840 


caacgttgtt 


gccattgcta 


caggcatcgt 


ggtgtcacgc 


tcgtcgtttg 


gtatggcttc 


3900 


attcagctcc 


ggttcccaac 


gatcaaggcg 


agttacatga 


tcccccatgt 


tgtgcaaaaa 


3960 


agcggttagc 


tccttcggtc 


ctccgatcgt 


tgtcagaagt 


aagttggccg 


cagtgttatc 


4020 


actcatggtt 


atggcagcac 


tgcataattc 


tcttactgtc 


atgccatccg 


taagatgctt 


4080 


ttctgtgact 


ggtgagtact 


caaccaagtc 


attctgagaa 


tagtgtatgc 


ggcgaccgag 


4140 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


taccgcgcca 


catagcagaa 


ctttaaaagt 


4200 


gctcatcatt 


ggaaaacgtt 


cttcggggcg 


aaaactctca 


aggatcttac 


cgctgttgag 


4260 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


caactgatct 


tcagcatctt 


ttactttcac 


4320 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


gcaaaatgcc 


gcaaaaaagg 


gaataagggc 


4380 


gacacggaaa 


tgttgaatac 


tcatactctt 


cctttttcaa 


tattattgaa 


gcatttatca 


4440 


gggttattgt 


ctcatgagcg 


gatacatatt 


tgaatgtatt 


tagaaaaata 


aacaaatagg 


4500 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


acctgacgtc 


taagaaacca 


ttattatcat 


4560 


gacattaacc 


tataaaaata 


ggcgtatcac 


gaggcccttt 


cgtctcgcgc 


gtttcggtga 


4620 


tgacggtgaa 


aacctctgac 


acatgcagct 


cccggagacg 


gtcacagctt 


gtctgtaagc 


4680 


ggatgccggg 


agcagacaag 


cccgtcaggg 


cgcgtcagcg 


ggtgttggcg 


ggtgtcgggg 


4740 


ctggcttaac 


tatgcggcat 


cagagcagat 


tgtactgaga 


gtgcaccata 


tcgacgctct 


4800 


cccttatgcg 


actcctgcat 


taggaagcag 


cccagtagta 


ggttgaggcc 


gttgagcacc 


4860 


gccgccgcaa 


ggaatggtgc 


atgcaaggag 


atggcgccca 


acagtccccc 


ggccacgggg 


4920 


cctgccacca 


tacccacgcc 


gaaacaagcg 


ctcatgagcc 


cgaagtggcg 


agcccgatct 


4980 


tccccatcgg 


tgatgtcggc 


gatataggcg 


ccagcaaccg 


cacctgtggc 


gccggtgatg 


5040 


ccggccacga 


tgcgtccggc 


gtagaggatc 


tggctagcga 


tgaccctgct 


gattggttcg 


5100 


ctgaccattt 


ccggggtgcg 


gaacggcgtt 


accagaaact 


cagaaggttc 


gtccaaccaa 


5160 


accgactctg 


acggcagttt 


acgagagaga 


tgatagggtc 


tgcttcagta 


agccagatgc 


5220 


tacacaatta 


ggcttgtaca 


tattgtcgtt 


agaacgcggc 


tacaattaat 


acataacctt 


5280 


atgtatcata 


cacatacg 










5298 



<212> DNA 

<213> Artificial sequence 



<220> 

<223> plasmid 
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atttaggtga cactatagaa ctcacctatc tccccaacac ctaataacat tcaatcactc 60 

tttccactaa ccacctatct acatcaccaa gatatcactc gagcatggtg agcaagggcg 120 

aggagctgtt caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtgaacggcc 180 

acaagttcag cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga 240 

agttcatctg caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccttca 300 

cctacggcgt gcagtgcttc agccgctacc ccgaccacat gaagcagcac gacttcttca 360 

agtccgccat gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca 420 

actacaagac ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc 480 

tgaagggcat cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact 540 

acaacagcca caacgtctat atcatggccg acaagcagaa gaacggcatc aaggtgaact 600 

tcaagatccg ccacaacatc gaggacggca gcgtgcagct cgccgaccac taccagcaga 660 

acacccccat cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agcacccagt 720 

ccgccctgag caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga 780 

ccgccgccgg gatcactcac ggcatggacg agctgtacaa gcccccccag accagcatca 840 

cctctgccgt gctgcagagc ggcttccgca agatggcctt ccccagcggc aaggtcgagg 900 

gctgcatggt gcaggtcacc tgcggcacca ctaccctgaa cggcctgtgg ctggatgaca 960 

ccgtctactg cccccgccac gtgatctgca ccgccgagga catgctgaac cccaactacg 1020 

aggacctgct catccgcaag agcaaccact ccttcctggt gcaggccggc aacgtccagc 1080 

tgcgcgtgat cggccacagc atgcagaact gcctgctccg cctgaaggtg gacaccagca 1140 

accccaagac ccccaagtac aagttcgtgc gcatccagcc cggccagacc ttcagcgtgc 1200 

tggcctgcta caacggcagc cccagcggcg tgtaccagtg cgccatgcgc cccaaccaca 1260 

ccatcaaggg cagcttcctg aacgggagct gcggcagcgt gggcttcaac atcgactacg 1320 

actgcgtaag cttctgctac atgcaccaca tggagctgcc caccggcgtg cacgccggca 1380 

ccgacctgga gggcaagttc tacggcccct tcgtggaccg ccagaccgcc caggccgccg 1440 

gcaccgacac caccatcacc ctgaacgtgc tggcctggct gtacgccgcc gtgatcaacg 1500 

gcgaccgctg gttcctgaac cgcttcacca ctaccctgaa cgacttcaac ctggtggcca 1560 

tgaagtacaa ctacgagccc ctgacccagg accacgtgga catcctgggc cccctgagcg 1620 

cccagaccgg catcgccgtc ctggacatgt gcgccgccct gaaggagctg ctccagaacg 1680 

gcatgaacgg ccgcaccatc ctgggcagca ccatcctgga ggacgagttc acccccttcg 1740 

acgtcgtgcg ccagtgcagc ggcgtgacct tccagtaagg atccatatat agggcccggg 1800 

ttataattac ctcaggtcga cgtcccatgg ttttgtatag aatttacggc tagcgccgga 1860 
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tgcgacgccg 


gtcgcgtctt 


atccggcctt 


cctatatcag 


gctgtgttta 


agacgccgcc 


1920 


gcttcgccca 


aatccttatg 


ccggttcgac 


ggctggacaa 


aatactgttt 


atcttcccag 


1980 


cgcaggcagg 


ttaatgtacc 


accccagcag 


cagccggtat 


ccagcgcgta 


tataccttcc 


2040 


ggcgtacctt 


tgccctccag 


cgatgcccag 


tgaccaaagg 


cgatgctgta 


ttcttcagcg 


2100 


acagggccag 


gaatcgcaaa 


ccacggtttc 


agtggggcag 


gggcctcttc 


cggcgattct 


2160 


tactagctag 


tatgcatagg 


tgctgaaata 


taaagtttgt 


gtttctaaaa 


cacacgtggt 


2220 


acgtacgata 


acgtacagtg 


tttttccctc 


cacttaaatc 


gaagggtagt 


gtcttggagc 


2280 


gcgcggagta 


aacatatatg 


gttcatatat 


gtccgtaggc 


acgtaaaaaa 


agcgagggat 


2340 


tcgaattccc 


ccggaacccc 


cggttggggc 


ccacgcctcg 


atcgagcaaa 


aaaaaaaaaa 


2400 


aaaaaaaaaa 


aaaaaaaaaa 


agcttggcgt 


aatcatggtc 


atagctgttt 


cctgtgtgaa 


2460 


attgttatcc 


gctcacaatt 


ccacacaaca 


tacgagccgg 


aagcataaag 


tgtaaagcct 


2520 


ggggtgccta 


atgagtgagc 


taactcacat 


taattgcgtt 


gcgctcactg 


cccgctttcc 


2580 


agtcgggaaa 


cctgtcgtgc 


cagctgcatt 


aatgaatcgg 


ccaacgcgcg 


gggagaggcg 


2640 


gtttgcgtat 


tgggcgctct 


tccgcttcct 


cgctcactga 


ctcgctgcgc 


tcggtcgttc 


2700 


ggctgcggcg 


agcggtatca 


gctcactcaa 


aggcggtaat 


acggttatcc 


acagaatcag 


2760 


gggataacgc 


aggaaagaac 


atgtgagcaa 


aaggccagca 


aaaggccagg 


aaccgtaaaa 


2820 


aggccgcgtt 


gctggcgttt 


ttccataggc 


tccgcccccc 


tgacgagcat 


cacaaaaatc 


2880 


gacgctcaag 


tcagaggtgg 


cgaaacccga 


caggactata 


aagataccag 


gcgtttcccc 


2940 


ctggaagctc 


cctcgtgcgc 


tctcctgttc 


cgaccctgcc 


gcttaccgga 


tacctgtccg 


3000 


cctttctccc 


ttcgggaagc 


gtggcgcttt 


ctcatagctc 


acgctgtagg 


tatctcagtt 


3060 


cggtgtaggt 


cgttcgctcc 


aagctgggct 


gtgtgcacga 


accccccgtt 


cagcccgacc 


3120 


gctgcgcctt 


atccggtaac 


tatcgtcttg 


agtccaaccc 


ggtaagacac 


gacttatcgc 


3180 


cactggcagc 


agccactggt 


aacaggatta 


gcagagcgag 


gtatgtaggc 


ggtgctacag 


3240 


agttcttgaa 


gtggtggcct 


aactacggct 


acactagaag 


gacagtattt 


ggtatctgcg 


3300 


ctctgctgaa 


gccagttacc 


ttcggaaaaa 


gagttggtag 


ctcttgatcc 


ggcaaacaaa 


3360 


ccaccgctgg 


tagcggtggt 


ttttttgttt 


gcaagcagca 


gattacgcgc 


agaaaaaaag 


3420 


gatctcaaga 


agatcctttg 


atcttttcta 


cggggtctga 


cgctcagtgg 


aacgaaaact 


3480 


cacgttaagg 


gattttggtc 


atgagattat 


caaaaaggat 


cttcacctag 


atccttttaa 


3540 


attaaaaatg 


aagttttaaa 


tcaatctaaa 


gtatatatga 


gtaaacttgg 


tctgacagtt 


3600 


accaatgctt 


aatcagtgag 


gcacctatct 


cagcgatctg 


tctatttcgt 


tcatccatag 


3660 


ttgcctgact 


ccccgtcgtg 


tagataacta 


cgatacggga 


gggcttacca 


tctggcccca 


3720 


gtgctgcaat 


gataccgcga 


gacccacgct 


caccggctcc 


agatttatca 


gcaataaacc 


3780 
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agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt 3840 

ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 3900 

ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gcttcattca 3960 

gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc aaaaaagcgg 4020 

ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg ttatcactca 4080 

tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga tgcttttctg 4140 

tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga ccgagttgct 4200 

cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta aaagtgctca 4260 

tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg ttgagatcca 4320 

gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact ttcaccagcg 4380 

tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 4440 

ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt tatcagggtt 4500 

artgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggttc 4560 

cgcgcacatt tccccgaaaa gtgccacctg acgtctaaga aaccattatt atcatgacat 4620 

taacctataa aaataggcgt atcacgaggc cctttcgtct cgcgcgtttc ggtgatgacg 4680 

gtgaaaacct ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg 4740 

ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt cggggctggc 4800 

ttaactatgc ggcatcagag cagattgtac tgagagtgca ccatatcgac gctctccctt 4860 

atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc 4920 

cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc 4980 

caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc 5040 

atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc 5100 

cacgatgcgt ccggcgtaga ggatctggct agcgatgacc ctgctgattg gttcgctgac 5160 

catttccggg gtgcggaacg gcgttaccag aaactcagaa ggttcgtcca accaaaccga 5220 

ctctgacggc agtttacgag agagatgata gggtctgctt cagtaagcca gatgctacac 5280 

aattaggctt gtacatattg tcgttagaac gcggctacaa ttaatacata accttatgta 5340 

tcatacacat acg 5353 

<210> 6 
<211> 115 
<212> DNA 
<213> Artificial 



<400> 6 



cggcatgaac ggccgcacca tcctgggcag caccatcctg gaggacgagt tcaccccctt 60 
cgacgtcgtg cgccagtgca gcggcgtgac cttccagtaa ggatccacta gttct 115 



<210> 7 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 7 

ctgaaggagc tgctccagaa cggcatgaac ggccg 35 

<210> 8 

<211> 115 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 8 

catgaagtac aactacgagc ccctgaccca ggaccacgtg gacatcctgg gccccctgag 60 
cgcccagacc ggcatcgccg tcctggacat gtgcgccgcc ctgaaggagc tgctc 115 

<210> 9 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 9 

aacgacttca acctggtggc catgaagtac aacta 35 

<210> 10 

<211> 115 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 10 

cggcaccgac accaccatca ccctgaacgt gctggcctgg ctgtacgccg ccgtgatcaa 60 
cggcgaccgc tggttcctga accgcttcac cactaccctg aacgacttca acctg 115 

<210> 11 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
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<400> 11 

cgccagaccg cccaggccgc cggcaccgac accac 



35 



<210> 12 

<211> 99 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 

<400> 12 „ 
tgctacatgc accacatgga gctgcccacc ggcgtgcacg ccggcaccga cctggagggc bo 

aagttctacg gccccttcgt ggaccgccag accgcccag 99 

<210> 13 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 

<400> 13 -., 
actacgactg cgtaagcttc tgctacatgc accac « 

<210> 14 

<211> 115 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 

<400> 14 „ 
ccagccacag gccgttcagg gtagtggtgc cgcaggtgac ctgcaccatg cagccctcga 60 

ccttgccgct ggggaaggcc atcttgcgga agccgctcat ctcgaggggg ggccc 115 

<210> 15 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 

<400> 15 

ggggcagtag acggtgtcat ccagccacag gccgt " 

<210> 16 

<211> 115 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 



<400> 16 

cgttgccggc ctgcaccagg aaggagtggt tgctcttgcg gatgagcagg tcctcgtagt 60 
tggggttcag catgtcctcg gcggtgcaga tcacgtggcg ggggcagtag acggt 115 

<210> 17 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SAR5 protease gene 
<400> 17 

gccgatcacg cgcagctgga cgttgccggc ctgca 35 

<210> 18 

<211> 115 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 18 

tgaaggtctg gccgggctgg atgcgcacga acttgtactt gggggtcttg gggttgctgg 60 
tgtccacctt caggcggagc aggcagttct gcatgctgtg gccgatcacg cgcag 115 

<210> 19 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 19 

gttgtagcag gccagcacgc tgaaggtctg gccgg 35 

<210> 20 

<211> 115 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer for cloning SARS protease gene 
<400> 20 

cgatgttgaa gcccacgctg ccgcagctcc cgttcaggaa gctgcccttg atggtgtggt 60 
tggggcgcat ggcgcactgg tacacgccgc tggggctgcc gttgtagcag gccag 115 

<210> 21 

<211> 37 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer for cloning SARS protease gene 
<400> 21 

gagaagctta cgcagtcgta gtcgatgttg aagccca 



<210> 22 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Spu primer 

<400> 22 

gcgtagcatt taggtgacac t 



<210> 23 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> AODA2303 primer 

<400> 23 

gtcagacccc gtagaaaaga 



<210> 24 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> GFP-RS-GUS (EOl-XhoI-Al) 

<400> 24 

gagactcgag tgatatcttg gtgatgtag 



<210> 25 

<211> 69 

<212> DNA 

<213> Artificial 

<220> 

<223> GFP-RS-GUS (GUS-RS-Sl) 



<400> 25 

gagactgcag agcggcttcc gcaagatggc cttccccagc ggcaaggtga tgttacgtcc 60 



tgtagaaac 



<210> 26 

<211> 30 

<212> DNA 

<213> Artificial 



69 



<220> 

<223> GFP-RS-GUS CGFP-XhoI-Sl) 
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<400> 26 

gagactcgag aatggtgagc aagggcgagg 



30 



<210> 27 

<211> 59 

<212> DNA 

<213> Artificial 

<220> 

<223> GFP-RS-GU5 (GFP-RS-Al) 

<400> 27 

gagactgcag cacggcagag gtgatgctgg tctggggggg cttgtacagc tcgtccatg 

<210> 28 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> GFP-RS-SA3CL pro (RS-3CL-S1) 

<400> 28 

gagactgcag agcggcttcc gcaagatggc 

<210> 29 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> GFP-RS-SA3CL pro (Ml3) 

<400> 29 

gtaaaacgac ggccagt 

<210> 30 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> GST-RS-SA3CL pro (GST-RS-sen) 

<400> 30 

gagactcgag atggaatccc eta 

<210> 31 

<211> 59 

<212> DNA 

<213> Artificial 

<220> 

<223> GST-RS-SA3CL pro (GST-RS-anti) 

<400> 31 

gagactgcag cacggcagag gtgatgctgg tctgcggtgg gtccgatttt ggaggatgg 



<210> 32 

<211> 306 

<212> PRT 

<213> SARS-COV 

<400> 32 

Ser Giy Phe Arg Lys Met Ala Phe Pro ser Gly Lys val Glu Giy Cys 
15 10 15 

Met val Gin val Thr Cys Gly Thr Thr Thr Leu Asn Gly Leu Trp Leu 
20 25 30 

Asp Asp Thr val Tyr Cys Pro Arg His val lie Cys Thr Ala Glu Asp 
35 40 45 

Met Leu Asn Pro Asn Tyr Glu Asp Leu Leu lie Arg Lys Ser Asn His 
50 55 60 

ser Phe Leu val Gin Ala Gly Asn val Gin Leu Arg val He Gly His 
65 70 75 80 

ser Met Gin Asn Cys Leu Leu Arg Leu Lys val Asp Thr ser Asn Pro 
85 90 95 

Lys Thr Pro Lys Tyr Lys Phe val Arg lie Gin Pro Gly Gin Thr Phe 
100 105 110 

ser val Leu Ala Cys Tyr Asn Gly ser Pro Ser Gly val Tyr Gin Cys 
115 120 125 

Ala Met Arg Pro Asn His Thr lie Lys Gly Ser Phe Leu Asn Gly Ser 
130 135 140 

Cys Gly Ser Val Gly Phe Asn lie Asp Tyr Asp Cys val ser Phe Cys 
145 150 155 160 

Tyr Met His His Met Glu Leu Pro Thr Gly Val His Ala Gly Thr Asp 
165 170 175 

Leu Glu Gly Lys Phe Tyr Gly Pro Phe val Asp Arg Gin Thr Ala Gin 
180 185 190 

Ala Ala Gly Thr Asp Thr Thr He Thr Leu Asn Val Leu Ala Trp Leu 
195 200 205 

Tyr Ala Ala val He Asn Gly Asp Arg Trp Phe Leu Asn Arg Phe Thr 
210 215 220 
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Thr Thr Leu Asn Asp Phe Asn Leu val Ala Met Lys Tyr Asn Tyr Glu 
225 230 235 240 

Pro Leu Thr Gin Asp His Val Asp lie Leu Gly Pro Leu ser Ala Gin 
245 250 255 

Thr Gly lie Ala Val Leu Asp Met Cys Ala Ala Leu Lys Glu Leu Leu 
260 265 270 

Gin Asn Gly Met Asn Gly Arg Thr lie Leu Gly ser Thr lie Leu Glu 
275 280 285 

Asp Glu Phe Thr Pro Phe Asp val val Arg Gin cys ser Gly val Thr 
290 295 300 

Phe Gin 
305 

<210> 33 

<211> 25 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> RS Cleavage Sequence 



Pro Pro Gin Thr Ser lie Thr Ser Ala val Leu Gin Ser Gly Phe Arg 
15 10 15 

Lys Met Ala Phe Pro Ser Gly Lys Val 
20 25 
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